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Abstract 


An  sxtention  of  the  alpha-beta  tree  pruning  strategy  to  game 
trees  with  "probability"  nodes,  whose  values  are  defined  as  the 
(possibly  weighted)  average  of  their  successors'  values,  is 
developed.  These  ”*-mininax"  trees  pertain  to  games  involving 
chance  but  no  concealed  information. 

An  initial  left-to-right  depth-first 
algorithm  is  developed  and  shown  to  reduce  the  complexity  of  an 
exhaustive  search  strategy  by  10  to  35  percent.  An  improved 
algorithm  is  then  formulated  for  the  class  of  "regular"  "-minimax 
trees.  With  random  ordering  of  successor  nodes,  this  modified 
algorithm  Is  shown  to  reduce  search  by  more  than  50  percent. 
With  optimal  ordering,  it  is  shown  to  reduce  search  complexity  by 
an  order  of  magnitude.  In  particular,  the  mmtber  of  leaves 
examined  for  a  uniform  3-level  (2-ply)  tree  is  reduced  from  N**3 
to  1.7211  N**2.  For  a  branching  factor  of  20,  this  means  that  a 
second  ply  can  be  attained  by  looking  at  about  34  times  as  manv 
leaves,  a  substantial  improvement  over  the  factor  of  400  required 
by  the  "obvious"  algorithm.  Our  strategies  can  be  adapted,  as 
the  ordinary  alpha-beta  procedure  has  been,  to  take  advantage  of 
the  special  features  of  a  particular  game. 
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i.  Introduction 

Many  games  involving  chance  events,  such  as  the  roll  of  dice 
or  the  drawing  of  playing  cards,  can  be  modeled  by  introducing 
■probability"  nodes  into  standard  minimax  trees.  In  this  paper, 
we  shall  use  the  symbols  +  and  -  to  denote  maximizing  and 
minimizing  nodes,  respectively,  and  *  (pronounced  "star")  to 
denote  a  probability  node.  We  define  the  value  of  a  *  node  as 
the  weighted  average  of  the  values  of  its  successors,  which  may 
occur  with  differing  probabilities.  A  sample  "*-minimax"  tree, 
as  we  shall  call  trees  made  up  of  +,  -,  and  *  nodes,  appears  in 
Figure  1.  Backed-up  values  for  non-terminal  nodes  are  shown  in 
parentheses.  The  value  of  the  *  node,  whose  successors  have  been 
assumed  to  be  equally  likely,  has  been  computed  as  (2-4)  /  2  - 
-1. 

+  (3) 

/  \ 

/  \ 

/  \ 

-  (3)  *  (-1) 

/  \  /  \ 

/  \  /  \ 

4  3  -  (2)  -4 

/  \ 

/.  \ 

2  3 

Figure  1  -  A  Sample  *-Minimax  Tree 

In  this  caper  we  shall  develop  and  analyze  a  search  strategy 
for  *-minimax  trees.  Our  algorithm  reduces  to  the  familiar 
aloha-beta  procedure  (Knuth  and  Moore  (2U  for  degenerate  *- 
minimax  trees,  i.e.  those  with  only  +  and  -  nodes.  Readers 
unfamiliar  with  ordinary  minimax  trees  should  refer  to  Section  3 
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and  perhaps  consult  Nilsson  [11  or  any  of  [2-5],  To  facilitate 
analysis,  we  shall  assume  that  all  descendents  of  a  *  node  are 
equally  likely.  The  algorithm  we  present  can  be  extended,  in  a 
direct  way,  to  the  more  general  case. 

For  the  most  part,  *-minimax  trees  retain  the  properties  of 
ordinary  minimax  trees.  In  particular,  they  pertain  to  2-person, 
0-sum,  perfect  information  games.  By  "perfect  information"  we 
mean  that  neither  player  conceals  information  about  the  current 
state  of  the  game,  or  possible  future  states,  that  could  be 
useful  to  the  other  player.  Many  dice  qames  (e.g.  craps, 
backgammon,  and  monopoly  and  similar  board  games)  satisfy  these 
criteria,  as  do  some  card  games  (e.g.  casino  blackjack). 


Unlike  ordinary  minimax  trees,  where  +  nodes  always  lead  to 
-  nodes  and  vice  versa,  trees  for  *-minimax  games  exhibit  many 
forms.  For  instance,  the  top  portion  of  a  tree  for  casino  . 
-blackjack,  where  the  strategy  of  the  dealer  {"house")  is 
predetermined,  thus  eliminating  branches  beneath  -  nodes,  is 
given  (in  simplified  form)  in  Figure  2. 


Hit  /  \  Stand 

/  \ 

*  _ 

/  I  \  ! 

/  I  \  I 

loss  +  . . .  +  * 

/  I  \ 

/  I  \ 

loss  -  ...  win 

Figure  2  -  Portion  of  a  Casino  Blackjack  Tree 


Compare  the  structure  of  this  tree  fragment,  with  its  notable 
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absence  of  alternation  between  +  and  -  nodes,  with  the  backgammon 
tree  fragment  of  Figure  3. 


+ 

/  \ 

Double  /  \  Roll 

/  \ 

_  * 

Accept  /  \  Decline  /  I  \ 

/  \  /  I  \ 

*  win  +  +  ...  + 

Figure  3  -  Portion  of  a  Backgammon  Tree 

2.  The  ♦ -Minimax  Search  Problem 

Raving  defined  and  given  examples  of  "-minimax  trees,  we  now 
consider  the  question  of  searching  these  trees.  At  the  very 
least,  we  want  to  retain  the  aloha-beta  "cutoff"  power  of 
ordinary  minimax  trees.  However,  the  presence  of  *  nodes 
provides  opportunities  for  additional  forms  of  cutoffs.  Our 
strategy  is  based  on  the  fact  that  lower  and  upper  bounds  on  the 
value  of  a  *  node  can  be  derived  by  exploring  one  or  more  of  its 
children.  Our  search  algorithm  will  (indirectly)  associate  such 
lower  and  upper  bounds  with  each  *  node.  Since  alpha  and  beta 
values  will  have  been  passed  into  a  *  node,  we  can  discontinue 

search  below  it  if  the  lower  "'-bound  ever  'exceeds  beta,  or  if  the 

upper  "-bound  ever  becomes  less  than  alpha.  In  the  former  case, 
the  -  player  will  have  already  found  a  path  that  holds  his 
opponent  to  less  than  the  lower  limit  of  the  *  node  value.  In 

the  latter  case,  +  will  have  already  found  a  way  to  do  better 

than  the  upper  limit  of  the  *  node  value.  Thus,  optimal  play  by 
both  players  will  assure  that  the  *  node  in  question  is  never 
reached,  rendering  further  exploration  beneath  it  futile. 


As  an  example  of  a  possible  "*  cutoff",  suppose  the  (leaf} 
values  of  a  particular  tree  are  integers  between  0  and  10, 
inclusive,  and  that  a  *  node  with  4  equally  likely  successors  has 
had  2  of  its  successors  searched.  This  situation  is  shown  in 
Figure  4. 

*  < - 2  <=  val  <=  7 


5  3?? 

Figure  4  -  Interim  Bounds  on  a  *  Node 

Knowing  the  values  of  these  2  children,  we  can  say  that  the 
smallest  value  subsequent  search  can  assign  to  the  *  node  is 
(5  +  3  +  0  +  0)  /  4,  or  2.  Similarly,  the  greatest  possible 
value  of  the  *  node  is  (5  +  3  +  10  +  10)  /  4,  or  7.  Thus,  a 
cutoff  can  occur  if  the  alpha  value  passed  to  *  is  >*  7,  or  if 
the  beta  value  is  <*  2.  We  shall  formulate  a  search  strategy  to 
take  advantage  of  this  form  of  potential  cutoff.  In  addition, 
our  strategy  will  compute  new  alpha  and  beta  values  for  use  below 
*  nodes. 

3.  A  Strategy  for  Searching  *-Minimax  Trees 

Before  looking  at  ways  in  which  *  cutoffs  can  occur,  it  will 
be  useful  to  recall  the  circumstance  in  which  nodes  of  an 
ordinary  minimax  tree  can  be  pruned,  without  loss  of  accuracy. 
At  each  +  node,  the  +  player  will  choose  the  successor  with  the 
»  highest  value,  while  at  -  nodes  the  -  olayer  will  choose  the 
successor  with  the  smallest  value.  Consider  Figure  5. 
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+ 

a  /  \  b 

/  \ 

4 

x  /  \  y 

/  \ 

3  ? 

Figure  5  -  A  Conventional  Cutoff 

Having  determined  the  value  (either  terminal  or  backed-up)  of  the 
node  below  move  a  as  4,  move  b  is  "refuted"  by  move  x,  which 
establishes  that  the  value  of  move  b  is  no  more  than  3  (perhaps 
less).  Move  y  may  be  "cut  off"  since  its  value  (indicated  by 
"?")  has  no  bearing  on  either  the  value  of  the  root  of  the  tree 
or  on  deciding  the  best  move  from  the  root.  Since  the  node  with 
?  as  value  could  be  the  root  of  a  sizable  subtree,  the  searching 
of  many  thousands  of  nodes  may  have  been  eliminated. 

The  standard  method  of  having  a  search  algorithm  recognize 
opportunities  for  cutoffs  such  as  these  is  to  associate  so-called 
"alpha"  and  "beta"  values  with  each  node  n  of  the  tree.  The 
alpha  value  tells  how  well  +  can  do  if  node  n  is  encountered 
during  optimal  play  by  both  players.  Similarly,  the  beta  value 
tells  how  badly  -  can  make  +  do,  again  assuming  perfect  play 
reaches  node  n.  If  the  value  of  a  node  is  ever  determined  to 
exceed  its  beta  value,  or  to  be  less  than  its  alpha  value,  this 
must  mean  that  optimal  play  will  not  lead  to  the '  node  in 
question.  Therefore,  further  searching  is  pointless.  In  fact, 
subsequent  nodes  can  be  cut  off  if  a  successor  value  equals  the 
aloha  or  beta  value.  In  this  case,  one  of  the  players  will  have 
found  another  line  of  play,  with  at  least  as  good  a  value  for 


him,  which  has  already  been  searched.  If  a  plaver  has  two  or 
more  equally  good  moves,  it  doesn't  matter  which  one  is  made. 


Consider  now  the  partially  searched  *-minimax  tree  of  Figure 
6,  and  assume  that  leaf  values  range  from  -10  to  +10,  inclusive. 


+ 

/  \ 

4 

/  \ 

5  *  < -  <4  5> 

/  I  \ 

2  I  ... 

I 

_  < -  <0  10> 

/  \ 

3  *  < -  <0  3> 

/  I  \ 

-3  I  ... 

I 

+  < -  <-7  ,  10> 

Figure  6  -  A  Partially  Searched  *-Minimax  Tree 


The  alpha-beta  values  of  4  and  5  for  the  *  node  at  deDth  2 
indicate  that  if  the  value  of  this  node  is  determined  to  be  <=  4, 
or  >*  5,  search  beneath  it  can  be  discontinued.  Suppose  now 
that,  as  shown,  there  are  3  successors,  the  first  of  which  has 
been  searched  and  found  to  have  a  value  of  2.  If  -10  and  +10  are 
limits  on  leaf  values,  then  the  value  of  the  '*  node  lies  between 
(2  -  10  -  10)  /  3  and  (2  +  10  +  10)  /  3,  i.e.  between  -6  and 
7  1/3.  Since  -6  is  not  greater  than  5,  nor  is  7  1/3  less  than  4, 
we  must  continue  searching  children  of  the  *  node.  However, 
before  so  doing,  we  ask  ourselves,  what  values  of  the  lower-level 
*  node  to  be  searched  will  entail  a  cutoff  at  the  *  node?  Denoting 
this  value  by  V,  we  want  to  know  V  for  which 
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(2  ♦  V  +  10)  /  3  <-  4,  i.e.  V  <-  0 
or  (2  ♦  V  -  10)  /  3  >-  5,  i.e.  V  >-  23  . 

These  values  for  V  can  now  be  used  as  alpha-beta  values  for  the 
lower-level  node  to  be  searched.  Raving  assumed  10  as  an  uoper 
bound  on  game  values,  however,  we  will  use  10  rather  23  as  the 
beta  value.  Thus,  as  indicated  in  Pigure  6,  0  and  10  serve  as 
alpha-beta  values  for  the  -  node  at  depth  3. 

Suppose  now  that  we  search  the  first  descendent  of  this  - 
node  at  depth  3  and  find  a  value  of  3.  Since  3  is  not  less  than 
the  0  alpha  value,  we  continue  searching',  and  the  next  node  to  be 
searched  is  the  *  node  at  depth  4.  However,  since  3  is  less  than 
the  current  beta  value  of  10,  we  pass  down  3  as  the  new  beta 
value  for  the  *  node,  while  the  alpha  value  of  0  is  unchanged. 

Having  reached  the  *  node  at  depth  4,  we  can  discontinue 
searching  if  its  value  is  found  to  be  less  than  or  equal  to  0,  or 
greater  than  or  equal  to  3.  In  the  latter  ca^e,  -  will  have 
already  found  a  way  to  hold  his  opponent  to  3,  and  surely  won't 
give  ♦  the  chance  to  achieve  3  or  better  at  the  *  node.  In  the 
former  case,  a  cutoff  below  the  *  node  will  be  followed 
immediately  by  a  cutoff  at  the  parent  -  node,  which  will 
immediately  cause  a  cutoff  at  its  parent  *  node,  which  in  turn 
will  entail  a  cutoff  below  the  top  -  node.  This  possibility  for 
two  or  more  cutoffs  to  occur  without  intervening  leaf  searches  is 
without  counterpart  in  conventional  minimax  trees. 
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By  reasoning  as  above,  we  can  determine,  after  seeing  the  -3 
successor  of  the  *  node  at  depth  5,  that  the  *  node  value  lies 
between  -7  2/3  and  5  2/3,  and  that  -7  and  10  should  serve  as 
alpha-beta  values  for  the  +  node  to  be  searched  next. 


4.  An  Algorithm  for  Searching  "-Minimax  Trees 

We  now  formalize  the  reasoning  presented  above.  Let  L  and  U 
denote  lower  and  upper  bounds  on  all  possible  game  (leaf)  values. 
Let  VI,  V2,  VN  be  the  values  of  the  N  successors  of  a  * 
node,  whose  i-th  successor  is  about  to  be  searched.  After 
returning  from  the  i-th  node,  a  cutoff  will  occur  if 


(VI  + 


+  Vi-1)  +  Vi  +  U  *  (N  -  i) 


N 


<=  alpha 


(la) 


or  if 


(Vl  + 


+  Vi-l)  +  Vi  +  L  *  (N  -  i) 
N 


>=  beta 


(lb) 


i.  Letting  Ai  represent  the  alpha  value  for  the  i-th  successor,  we 
\  have 

?’■ 

Ai  «  N  *  alpha  -  (VI  +  ...  +  Vi-1)  -  U  *  (N  -  i)  (2a) 

where  "alpha"  denotes  the  alpha  value  of  the  present  *  node. 
Similarly,  letting  Bi  represent  the  new  beta  value,  we  have 

Bi  -  N  *  beta  -  (VI  +  ...  +  Vi-1)  -  L  *  (N  -  i)  (2b) 

I- 
* 

h 

A 

i ^ 

v 

§ 


where  "beta"  is  the  beta  of  the  *  node..  In  the  actual 
implementation,  we  will  want  to  assure  than  all  A's  are  >=  L  and 
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all  B's  ace  <■  0.  From  the  equations  above  we  see  that  uo-to- 
date  A  and  B  values  can  be  computed  efficiently  if  they  are 
initialized  as 

A1  -  N  *  (alpha  -  U)  +  U  (3a) 
B1  «  N  *  (beta  -  L)  ♦  L  (3b) 


and  updated  by 


(4a) 

(4b) 


An+1  -  An  +  U  -  Vn 
Bn+1  ■  Bn  +  L  -  Vn 


Note  that,  when  N  «  1,  A1  and  B1  take  on  the  alpha-beta  values 
themselves. 


From  the  above  formulation  we  derive  the  following  search 
procedure  for  *  nodes: 


Star 1 (board,  alpha,  beta) 

local  A,  B,  i,  v,  vsum,  AX,  BX,  sfl; 

determine  the  N  successors  si,  s2,  ...,  sN 
if  (N  «■  0) 

return (Term (board) ) j 

A  ■  N  *  (alpha  -  0)  ♦  Uj 
B  -  N  *  (beta  -  L)  +  L; 
vsum  «  0? 

for  (i«l;  i<«N;  i++)  { 

AX  «  max  (A,  L) ; 

BX  >  min  (B,  U) ; 
v  »  Bval (s [i] ,  AX,  BX) ; 
if  (v  <*  A) 

return (alpha) ; 
if  (v  >■  B) 

return (beta)  i 
vsum  *  vsum  ♦  v; 

A  ■  A  +  U  -  v; 

B  ■  B  +  L  -  v; 

} 

return (vsum  /  N) * 


I 
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This  code  makes  use  of  (1)  a  Te rm  procedure,  to  evaluate  terminal 
positions,  (2)  an  Eval  procedure  which,  depending  on  which  player 
is  to  move  next,  invokes  either  Max  or  Min?  and  (3)  a  procedure 
to  generate  the  successors  of  a  node. 

5.  A  "Better"  Solution  for  "Regular"  *-Minimax  Trees 

The  strategy  developed  above  assumes  that  each  successor  of 
a  *  node  could  be  either  a  -  or  a  node,  independent  of  its 
sister  nodes.  In  many  *-minimax  games,  however,  most  *  nodes 
fall  into  one  of  two  classes:  those  with  only  +  successors,  and 
those  with  onlv  -  successors.  In  these  games,  chance  events  are 
used  to  determine  legal  moves  (as  in  backgammon^ ,  or  the  outcome 
of  a  move  (as  in  blackjack) ,  or  both,  but  not  to  determine  who  is 
to  have  the  next  move.  (If  chance  is  used  to  decide  who  makes 
the  very  first  move,  this  one-time  event  is  unrelated  to  search 
matters.)  We  shall  refer  to  games  such  as  these  as  regular  *- 
minimax  games.  Trees  corresponding  to  these  games  alternate 
between  +  and  -  nodes,  as  do  ordinary  minimax  trees,  but  with  * 
nodes  interspersed.  Thus,  on  a  path  from  the  root  we  encounter 
node  types  of*,  +  ,  *,  -,  *,  +,  and  so  forth. 


For  the  newly-defined  class  of  regular  *-minimax  trees,  we 
can  devise  a  "better"  search  procedure,  to  be  called  Star2,  which 
is  later  shown  to  be  greatly  superior  to  the  Starl  procedure  it 
directly  extends.  The  algorithm  underlying  Starl  is  based  on  a 
strict  depth-first  control  strategy.  Thus,  if  X  and  Y  are 
successors  of  some  *  node,  we  cannot  examine  children  of  X, 
suspend  work  with  X  to  begin  searching  beneath  Y,  and  then  later 
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return  to  additional  nodes  beneath  X.  Consider  a  *  node  all  of 
whose  children  are  -  nodes.  In  this  situation,  the  left-to-right 
restriction  imposed  by  a  depth-first  control  strategy  has  two 
drawbacks.  First,  for  a  given  *  node,  it  forces  us  to  look  at 
all  N  leaves  beneath  all  but  the  last  -  node  searched  (unless 
some  leaf  takes  on  the  minimum  value  over  all  possible  leaves) . 
Second,  we  assumed  that  unprocessed  -  nodes  could  have  the 
maximum  possible  game  value.  Each  of  these  problems  is  answered 
by  the  following  modification  developed  below. 

If  a  *  node  being  examined  is  worse  than  a  previously 
searched  *  node,  a  preliminary  "probing*  of  just  one  child  of 
each  -  node  can  substantially  reduce  the  number  of  nodes  explored 
before  a  cutoff  occurs.  If  Wi  denotes  the  value  of  some  child  of 
the  i-th  -  node,  and  as  before  Vi  denotes  the  (true)  value  of  the 
i-th  -  node,  we  will  obtain  a  cutoff  below  the  *  node  if 


(VI  +  ...  ♦  Vi-1)  ♦  Vi  ♦  (Wi+1  + 

N 


+  WN) 

. .  <*  alpha 


(5) 


which  yields  an  Ai  value  of 

Ai  -  aloha  *  W  -  (VI  +  ...  +  Vi-1)  -  (wi+1  +  ...  ♦  NN)  (6) 

which  can  be  efficiently  computed  by  initializing  to 

AI  -  alpha  *  N  -  (W2  +  ...  +  WW)  (7) 

and  updating  by 

An+1  «  An  +  Wn+1  -  Vn 


(8) 


Since  all  W*s  are  <»  U,  the  values  computed  for  A  in  (6)  are 
never  leas  than  corresponding  values  in  (2a). 

If  the  *  node  being  searched  is  not  better  than  all 
previously  searched  alternatives,  so  that  a  cutoff  will  occur, 
then  unless  we  are  quite  unlucky  in  selecting  the  particular 
children  of  -  nodes  to  explore,  these  tiqhter  bounds  for  A  will 
allow  for  an  earlier  cutoff  than  the  formulas  given  earlier. 
Since  we  cannot  speak  with  confidence  about  how  large  the  true 
value  of  a  -  node  might  be  without  looking  at  all  its  successors, 
no  special  use  can  be  made  of  the  B  in  (2b)  during  the 
preliminary  probing  stage. 

We  formalize  these  ideas  in  the  following  orocedure.  In 
order  to  detect  possible  cutoffs  during  the  probing  phase,  and  to 
avoid  a  subscript  range  error  the  last  time  through  the  second 
loop,  the  calculations  specified  by  (7)  and  (8)  have  been 
"distributed"  into  disjoint  places  in  the  code  of  the  new 
procedure.  In  keeping  with  the  discussion  above,  this  code 
pertains  to  a  *  node  followed  by  -  nodes.  A  related  orocedure 
for  *  nodes  followed  by  +  nodes  will  also  be  needed. 


Star 2Min (board,  alpha,  beta) 
local  A,  B,  i,  v,  vsum,  AX,  BX,  s[],  w[1; 


determine  the  N  successors  si,  s2,  ...,  sN 
if  (N  «  0) 

return (Term (board) ) ; 

A  ■  N  *  (aloha  -  U)  +  U; 

B  «  N  *  (beta  -  L)  +  L; 

BX  «  min (B,  U) ; 

for  (i*l;  i<*N?  i++)  J 
AX  «  max (A,  L) j 
w(i)  *  Probe  (s(i],  AX,  BX)  ; 
if  (w(i1  <«  A) 
return (aloha) ; 

A  »  A  +  U  -  w[i)  ; 

} 

A  «  A  -  0; 
vsum  *  0; 

for  (i*t|  i<eN;  i++)  ( 

A  -  A  +  w[i]  ; 

AX  »  max (A,  L) ; 

BX  «  min  (B,  U)  j 
v  «  Min(s[i],  AX,  BX) ; 
if  (v  <■  A) 

return (alpha) ; 
if  (v  >»  B) 

return(beta) j 
vsum  ■  vsum  +  v; 

A  ■  A  -  v; 

B  «  B  +  L  -  v; 

I 

return (vsum  /  N) ; 


Here  we  have  referred  to  a  new  orocedure  Probe (x,v,z)  whose  job 
is  to  return  Term(x),  if  x  is  a  leaf,  otherwise  to  choose  some 
successor  s  of  x,  either  at  random  or  by  aooeal  to  a  static 
evaluation  function,  and  return  Min(s,y,z). 


In  the  event  that  preliminary  probing  fails  to  obtain  a 
cutoff,  the  modified  algorithm  given  above  reverts  to  the 
original  algorithm,  albeit  with  a  tighter  A  bound  and  therefore 
with  an  equal  or  better  opportunity  for  an  earlv  cutoff.  Rather 
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than  exhaustively  searching  -  nodes  one  by  one,  however,  a  second 
>|V  child  of  each  -  node  might  be  explored.  If  a  cutoff  has  still 
not  occurred,  a  third  successor  of  each  -  node  would  be 
%  considered,  and  so  forth.  This  "cyclic"  strategy  mav  require 
*  more  SDace  for  keeping  track  of  which  children  of  each  -  node 

k 

have  been  expanded,  but  the  even  tighter  bounds  available  after 
k  looking  at  only  2N  -  node  successors  may  more  than  make  ud  for 

this  in  certain  situations.  In  Section  7.4  we  consider  more 

V-,.' 

2-'  radical  modifications  of  Star2. 
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6.  Analysis  of  * -Minimax  Algorithms 

In  analyzing  the  efficiency  of  a  search  procedure  for  a 
class  of  game  trees,  one  begins  by  specifying  a  subclass  for 
study.  This  involves  deciding  on  (a)  the  overall  structure  of 
trees;  (b)  a  way  of  assigning  values  to  leaves;  and  (c)  the 
criterion  to  be  measured.  We  want  our  analysis  of  "-minimax 
search  to  resemble  that  for  ordinary  aloha-beta  wherever  Dossible 
(e.g.  [2-51).  Since  virtually  all  studv  of  aloha-beta  has  chosen 
to  count  the  number  of  leaves  encountered  as  the  efficiency 
measure,  we  will  do  so  as  well.  Furthermore,  most  analyses  of 
ordinary  alpha-beta  have  considered  so-called  "complete"  N-ary 
trees,  where  all  leaves  occur  at  a  fixed  depth  D  and  all  non¬ 
terminal  nodes  have  exactly  N  successors.  We  define  the  class  of 
♦-complete  N-ary  trees  by  inserting  a  *  node  above  each  node  of  a 
complete  N-ary  tree,  and  giving  these  *  nodes  N-l  additional 
♦successor  nodes  of  the  same  type.  These  trees  satisfy  the 
definition  of  "regular"  *-minimax  trees  as  given  in  the  orevious 


section.  Corresponding  to  a  complete  N-ary  tree  of  depth  n, 
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which  has  N  **  D  leaves,  is  a  ‘-complete  N-ary  tree  of  depth  2D 
having  N  “  2D  leaves.  The  leftmost  part  of  a  *-comolete  2-ply 
binary  tree  appears  in  Figure  7.  Incidentally,  it  makes  sense, 
for  these  trees,  to  speak  of  half-ply,  i.e.  chance  events  not  yet 
responded  to. 

* 

/  \ 

/  \ 

+  + 

/  \ 

/  \ 

*  * 

/  \ 

/  \ 

/  \ 

/  \ 

Figure  7  -  The  Leftmost  Portion  of  a  2-Ply  ‘-Complete  Binary  Tree 

We  will  investigate  the  efficiency  of  the  ‘-minimax 
algorithms  on  *-complete  N-ary  trees  of  depth  3,  since  they 
correspond  to  minimax  trees  (of  depth  2)  allowing  the  simolest 
cutoffs  of  standard  alpha-beta  (see  Figure  5) .  From  Figure  7  we 
can  see  that  no  cutoff  is  possible  at  the  topmost  level,  since 
the  root  is  a  *  node.  (During  actual  play,  the  chance  event  will 
have  occurred  by  the  time  we  are  ready  to  select  a  move,  so  only 
one  4-  successor  of  the  top  *  node  will  be  searched  anyway.)  The 
left  4-  node  however  permits  a  cutoff  below  its  second  successor. 
By  the  time  this  *  node  has  been  reached,  an  alpha  value  will  be 
available.  We  will  consider  this  critical  portion  of  the  tree, 
which  is  given  in  Figure  8. 
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/  \ 

/  \ 
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/  \ 

/  \ 

/  \  /V 

/  \  /  \ 

Figure  8  -  Opportunities  For  a  Cutoff 
in  a  "-Complete  2-Plv  Binary  Tree 


In  an  attempt  to  capture  the  sorts  of  leaf  dependencies  that 
have  been  observed  in  practice,  we  follow  Fuller,  Gaschnig,  and 

n 

Gillogly  [3]  by  assigning  distinct,  uniformly  spaced  values  to 

the  arcs  below  a  node,  and  defining  a  leaf  value  as  the  sum  of 

the  arc  values  on  the  path  to  it  from  the  root.  Since  we  want 
our  methods  to  apply  to  (perhaps  differently  shaoed)  trees  where 
leaves  occur  at  various  levels,  we  cannot  use  simplv  the  values 
of  1  through  N  as  arc  labels,  as  Fuller  et  al  [31  did.  To  enable 
all  successors  of  a  node  to  share  the  same  a  priori  orobabilitv 
of  being  best,  we  want  the  arc  value  from  a  +  or  -  node  to  its 

•best"  successor  to  be  0,  and  the  "average"  arc  value  out  of  a  * 

£ 

>  node  to  be  0.  Therefore,  to  the  arcs  of  -  nodes  we  assign  values 
of  0,  1,  N— 1 ,  and  to  the  arcs  of  +  nodes  we  assign  values  of 

0,  -1  - (N-l) .  Assuming  N  is  even,  we  assign  to  the  arcs  of 

*  nodes  the  values  of  -N/2,  ...,  -2,  -1,  1,  2,  ...,  N/2. 

Readers  familiar  with  studies  of  the  alpha-beta  procedure 
(Knuth  and  Moore  [2],  Fuller  et  al  f 3 1 r  Newborn  [4],  Baudet  [51) 

H  "will  appreciate  the  algebraic  complexity  involved  in  obtaining 
precise  closed-form  performance  figures  for  even  shallow  trees 
(e.g.  depth  3).  Accordingly,  the  following  analysis,  which 
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determines  both  asymptotic  complexity  and  exact  values  for 
various  branching  factors,  combines  simulation  with  analytic 
techniques  where  appropriate. 

6 . 1  Best-Case  Analysis 

We  first  establish  the  best  case  behavior  of  the  Starl  and 
Star 2  procedures  on  the  class  of  *-complete  trees  just  defined. 
We  will  derive  asymptotic  values  in  closed  form,  then  present 
exact  figures  for  various  branching  factors  arrived  at  by 
empirical  means. 

As  in  ordinary  alpha-beta  search,  the  first  successor  of  the 
+  node  must  be  fully  searched,  and  we  will  obtain  the  qreatest 
number  of  cutoffs  if  the  best  node  is  searched  first.  Since  this 
value,  which  we  have  arranged  to  be  0,  cannot  be  improved  upon, 
it  will  serve  as  alpha  value  for  all  the  remaining  *  nodes,  which 
have  backed-up  values  of  -1,  -2,  ...,  -(W-l)  and  can  be  searched 
in  any  order  without  affecting  search  efficiency.  Since  the  node 
with  -1  as  value  is  almost  best,  it  will  take  longest  to  search, 
while  the  -(N-l)  node  will  be  dispensed  with  most  quickly. 

We  will  find  the  number  of  leaves  searched  for  the 
particular  *  node  having  backed-up  value  of  w-N/2  (a  value  chosen 
to  simplify  the  algebra) .  From  this  result  we  add  up  values  as  w 
runs  from  -(N/2-1)  to  N/2-1  to  obtain  the  overall  search 
efficiency  under  the  ♦  node  in  question. 
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6.1.1  Best-Case  Analysis  of  the  Star  1  Procedure 

To  obtain  a  cutoff  below  a  *  node,  we  will  need  to  beqin 
exploring  a  j-th  descendent  of  it  for  which 

VI  +  V2  +  ...  +  Vj  +  rj  *  (N  -  i)  <=  0  (9) 

where  Vi  denotes  the  backed-up  value  of  the  i-th  descendent  of 
the  *  node  and  0  is  the  active  aloha  value.  The  values  of  the  - 
nodes  beneath  *  range  from  w-N  to  w,  excluding  w-N/2.  To 
guarantee  the  earliest  possible  cutoff,  we  will  want  to  look 
first  at  the  nodes  with  low  values.  If  j  >  N/2,  which  we  shall 
see  below  is  always  true,  then  VI  through  vj  will  take  on  the 
values  w-N  through  w-(N-j),  excluding  w-N/2.  Letting  k  =  N- j ,  we 
can  rewrite  the  equation  above  as 

[k  ♦  (k+l)  +  ...  +  NT  -  N/2  +  fk-N)  *  w  >=  k  *  U  (10) 

But  the  summation  is  easily  written  in  closed  form,  and  U  (the 
maximum  leaf  value)  is  N/2  +  (N-l) ,  or  3N/2  -  1.  After 
cancelling  the  N/2  terms,  we  have 

N*N  -  (k*k  -  k) 

—  ■  ■  +  (k  -  N)  *  w  .  >«  k  *  (3N/2  -  1)  (11) 

2 

which  can  be  written  as  a  quadratic  (in  k)  as 

k*k  ♦  (3N  -  2w  -  3)  *  k  -  (N*N  -  2Nw)  <=  0  '  (12) 

from  which  the  quadratic  formula  yields  k  as  the  floor  of 

t 

1  +  2w  -  3N  ♦  sqrt (13N*N  -  20Nw  -  18N  +  4w*w  +  12w  +  9) 


2 


(13) 
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which  for  large  N  becomes 

(-3N  ♦  2w  +  sqrt(13N‘N  -  20Nw  +  4w*w) )  /  2  (14) 

This  formula  reveals  reduction  in  searching  the  worst,  median, 
and  nearly-best  *  nodes  of  44.9%,  30.3%,  and  0%,  respectively. 
To  determine  the  total  number  of  nodes  pruned,  we  add  up  the 
above  formula  as  w  runs  from  -(N/2-1)  to  N/2-1.  For  large  N, 
this  sum  can  be  found  by  integration.  Once  again  ignoring 
lower-order  terms,  we  obtain  a  fractional  savings  of  0.2789.  We 
can  therefore  state  the  following: 

Result  It  The  asymptotic  best-case  behavior  of  algorithm  Starl  on 
the  +  node  of  a  ‘-complete  2-ply  N-ary  tree  is  to 
examine  approximately  0.7211  N“3  of  the  N“3  leaves 
beneath  it. 

Table  I  gives  exact  values  for  the  best  case  performance  of  the 
left-to-right  Starl  orocedure  for  various  values  of  N.  Note  the 
convergence  toward  the  region  of  72  percent,  as  predicted  by  the 
analysis  above. 

N  2  4  6  8  10  20  30  40 

Number  5  40  138  336  670  5560  18990  45320 

Percent  62.5  62.5  63.9  65.6  67.0  69.5  70.3  70.8 

Table  I  -  Best-Case  Leaf  Exploration  of  Starl 
for  Various  ‘-Complete  2-Ply  Trees 

6.1.2  Best-Case  Analysis  of  the  Star2  Procedure 

If  the  successors  of  -  nodes  are  optimally  ordered,  so  that 
their  first  successors  are  minimal,  then  if  a  cutoff  will  occur 
at  all,  it  will  occur  during  the  preliminary  processing  phase. 
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This  means  that  only  one  successor  per  -  node  will  need  to  be 
looked  at  by  Star2,  whereas  all  N  of  all  but  the  last  -  node  were 
examined  by  Starl.  As  revealed  above,  an  average  of  72.11 
percent  of  the  N  -  nodes  below  each  of  the  N  *  nodes,  i.e. 
0.7211  N‘N  nodes,  will  be  examined  for  all  but  the  best  *  node, 
whose  N*N  leaves  must  all  be  considered.  This  gives  us: 

Result  •  The  asymptotic  best-case  behavior  of  alqorithm  Star2  on 
the  +  node  of  a  ‘-complete  2-olv  N-ary  tree  is  to 
,  examine  approximately  1.7211  n“2  of  the  N“3  leaves 

beneath  it. 

This  result  is  encouraqing  because,  like  the  0(N“2)  best,  case 
alpha-beta  result  for  depth  3  trees  (Knuth  and  Moore  [3]),  it 
shows  that  a  wise  algorithm  can  hope  to  reduce  search  complexity 
by  a  factor  of  N.  For  the  most  part,  we  achieved  this  reduction 
without  significantly  increasing  the  conceptual  complexity  of  the 
^  algorithm,  its  overhead,  or  the  additional  space  needed  (which 
will  in  fact  be  only  N‘D  cells  for  an  N-ary  tree  of  depth  D) . 
t5.  Table  II  gives  exact  values  for  the  best  case  performance  of  the 
.Jji.  Star 2  procedure  for  various  values  of  N. 

N  2  4  6  8  10  .  20  30  40 

Number  5  25  58  105  166  677  1532  2732 

Percent  62.5  39.1  26.9  20.5  16.6  8.5  5.7  4.3 

Table  II  -  Best-Case  Leaf  Exploration  of  Star2 
for  Various  ‘-Complete  2-Ply  Trees 

Average-Case  Analysis 

Since  average-case  analysis  is  more  difficult  than  best-case 
analysis,  we  decided  to  investigate  the  exoected-case  performance 
of  the  Starl  and  Star2  procedures  mainly  by  empirical  means.  To 
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do  this,  we  coded  the  algorithms  in  the  "C"  lanauaae  to  be  run  on 
our  PDP-11/70  system.  Since  the  foreqoing  alqebraic  best-case 
analysis  ignores  lower-order  terms,  and  thus  cannot  yield 
reliable  values  for  small  values  of  N,  we  preceded  our  averaqe- 
case  experimentation  by  running  each  algorithm  on  an  ootimallv 
ordered  tree.  The  results  of  this  exact  analysis  were  given  in 
Tables  I  and  II.  Finally,  after  competing  our  empirical  studv, 
we  undertook  a  simplified  algebraic  analysis  of  average-case 
Starl  performance,  which  led  to  an  iterative  formula  which  will 
be  presented  after  describing  the  results  of  the  empirical  study. 

6.2.1  Empirical  Average-Case  Studv 

Using  the  UNIX  pseudo-random  number  generator,  we  qenerated 
and  gathered  statistics  on  1000  *-comolete  trees  for  each  of 
several  branching  factors.  In  generating  the  successors  of  a 
node,  all  N!  permutations  of  successor  arcs  were  assumed  to  be 
equally  likely.  We  did  this  because  (ai  it  is  simple  to 
implement;  (b)  it  corresponds  to  completelv  "uninformed"  static 
evaluation  capabilities,  thus  giving  a  conservative  picture  of 
what  to  expect  in  practice;  and  (c)  it  has  been  assumed  by 
previous  researchers,  thus  enabling  a  comparison  of  *-minimax 
trees  against  ordinary  minimax  trees.  In  the  event  that 
preliminary  probing  failed  to  result  in  a  cutoff,  the  N  leaves 
seen  were  counted  twice  if  subsequent  search  required  a  full 
search  of  the  -  node  above  them.  Thus,  the  results  obtained 
represent  a  conservative  estimate  of  average  case  analvsis,  which 
is  itself  a  conservative  estimate  of  how  well  one  can  expect  to 


do  in  practice. 
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6. 2. 1.1  Average-Case  Analysis  of  the  Start  Procedure 

Table  III  presents  the  average-case  results  for  the  initial 
lef t-to-right  Starl  procedure.  It  can  be  seen  that  the  average 
case  savings  appears  to  be  about  21  percent,  slightlv  less  than 
3/4  times  the  best-case  savings  of  28  percent. 

N  2  4  6  8  10  20  30  40 

Number  7.1  53.9  178  418  810  6389  21382  50425 

Percent  88.8  84.1  82.5  81.6  81.1  79.9  79.2  78.8 

Table  III  -  Average-Case  Leaf  Exploration  of  Starl 
for  Various  ‘-Complete  2-Ply  Trees 

6. 2. 1.2  Average-Case  Analysis  of  the  Star 2  Procedure 

We  have  seen  that  with  optimal  orderinq,  the  search 
complexity  of  Star 2  on  regular  ‘-complete  trees  can  be  reduced 
from  0(N“3)  to  0(N“2)  .  This  may  lead  us  to  expect  a 
significant  improvement  in  its  average-case  behavior  as  well. 
Table  IV  summarizes  the  results  of  Star2  performance.  In  • 

addition  to  simply  counting  leaf  explorations,  we  decided  to 
gather  information  on  how  many  ‘-node  cutoffs  were  made  during 
the  preliminary  probing  phases.  An  interesting  result  was  that 
roughly  half  the  *  nodes  for  which  a  cutoff  occurred  were  cut  off 
durinq  the  probing  chase.  Also,  we  see  that  for  a  branching 
factor  greater  than  about  20,  Star 2  looks  at  fewer  than  half  the 
leaves  explored  by  Starl. 
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N 

4 

6 

8 

10 

20 

30 

40 

Cutoffs 

Probing 

1.3 

2.0 

2.8 

3.5 

8.1 

12.6 

17.4 

Regular 

0.7 

1.5 

2.5 

3.5 

8.4 

13.4 

18.3 

Leaves  Seen 

Number 

48 

139 

293 

531 

3341 

10109 

22390 

Percent 

75.4 

64.5 

57.3 

53.1 

41.8 

37.4 

35.0 

Table  IV  -  Average-Case  Leaf  Exploration  of  Star2 
for  Various  ‘-Complete  2-Ply  Trees 


6. 2. 1.3  Discussion  of  Average-Case  Star2  Results 

In  the  ‘-complete  trees  we  have  been  considering,  where  each 
non-term inal  node  has  N  succe  • — ,  there  will  be  Nl  ways  of 
ordering  the  arcs  below  each  node.  In  both  the  Starl  and  Star2 
procedures,  searching  is  lef t-to-r ight  on  the  *  nodes  below  the 
root.  In  this  case,  each  *  node  better  than  all  its  predecessors 
must  be  fully  searched.  If  each  of  the  Nl  permutations  is 
equally  likely,  the  expected  number  of  such  fullv-searched  * 
nodes  is  the  "harmonic"  function,  given  by 

H(N)  «  1  +  1/2  +  ...  +  1/N 

This  formula  is  easily  verified  by  induction:  given  H(N-l), 
adding  an  N-th  node  worse  than  all  the  others  will  have  no  effect 
of  the  searching  of  the  previous  N-l  nodes,  while  the  new  node 
will  be  fully  searched  only  if  it  is  placed  first  in  the 
permutation,  which  happens  one  time  in  N.  Thus,  the  number  of 
leaf  nodes  searched  beneath  best-so-far  *  nodes  is  N*N*B(N).  If 
in  Table  IV  we  subtract  from  N  the  average  number  of  cutoffs  that 
occurred,  we  observe  perfect  agreement  with  H(N)  (within  the  0.1 
tolerance  due  to  rounding).  For  instance,  with  N*40,  we  see  that 
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an  average  of  5  *  nodes  were  found  suDerior  to  previously 

searched  *  nodes.  This  accounts  for  5*40*40,  or  8000,  of  the 
leaves  that  were  explored.  The  remaining  35  *  nodes  can  be  seen 
to  have  had  an  average  of  (22390-8000) /35 ,  or  411,  of  the  1600 
leaves  beneath  them  examined.  The  reader  will  recall  that  not 
all  the  additional  savings  of  Star  2  is  made  Dossible  by 
preliminary  cutoffs,  but  also  by  the  lower  values  assiqned  for 

(  the  hA s.  For  instance,  the  18.3  *  nodes  which  led  to  a  "regular" 
cutoff  did  so  after  searching  fewer  than  ( 22390-8000) /18. 3 ,  or 
786,  leaves.  This  fiqure  reDresents  about  2/3  the  average  number 
of  (50425-8000) /35 ,  or  1212,  leaves  examined  bv  Starl  for  the 
corresponding  *  nodes. 

Earlier  we  suggested  looking  at  2  leaves  during  preliminary 
probing,  rather  than  just  1,  below  each  -  node.  Experimentation 

’  t 

with  this  enhancement  to  Star2  revealed  an  average  exploration  of 
(19529-8000)/35,  or  329,  leaves  for  these  *  nodes  (a  20  percent 
reduction) . 


6.2.2  Algebraic  Average-Case  Analysis  of  Starl 

We  have  observed  that  the  expected  number  of  fully-searched 
*  nodes  is  given  by  the  harmonic  function  H(N),  so  that  the 


number  of  leaves  searched  beneath  them  is 


A  (N)  «  N  *  N  *  H(N) 

V 

We  will  now  determine  B(N) ,  the  number  of  leaves  searched  beneath 

1  * 

y  the  remaining  *  nodes,  which  we  add  to  A (N)  to  find  C(N),  the 
total  average  number  of  leaves  searched  in  an  N-arv  *-comolete 
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2-ply  tree. . 

Bach  of  the  N-B(N)  *  nodes  not  better  than  all  preceding  * 
nodes  will  be  rejected  when  its  value  is  determined  to  be  less 
than  the  value  of  the  best  *  node  seen  so  far.  Recall  that  in 
the  ^-complete  trees  we  are  considering,  *  nodes  have  backed-up 
values  of  0,  -1,  -(N-l).  Suooose  a  given  *  node  has  a  value 

of  n,  and  m  is  the  value  of  the  best  *  node  seen  so  far,  where 
n  <  m.  In  addition  to  the  *  node  of  value  m,  any  of  the  *  nodes 
with  a  value  less  than  m  (but  not  equal  to  n)  can  precede  the  n- 
valued  node.  The  number  of  ways  this  can  occur  is  given  by 

N-M-2  N-M-2 

D (M)  «  SUM  (  )  *  (s+1) 1  *  (N-S-2) l  (15) 

S»0  s 

where  M  »  lm|.  A  cutoff  will  occur  below  this  n-valued  *  node 
when  we  have  searched  a  j-th  successor  for  which 

VI  ♦  ...  +  Vj-1  +  Vj  +  U  *  (N  -  j) 


But  the  values  of  the  -  nodes  .below  the  n-valued  *  node  are  n- 
K/2,  ...,  n-1,  n+1,  ...,  n+N/2,  with  an  average  value  of  n.  For 
large  N,  j  becomes  large  as  well,  and  the  expected  value  of  the 
relative  difference  between  (VI  +  ...  +  Vj)  and  jn  approaches  0. 

Although  the  average  expected  value  of  a  function  is  not  in 
general  the  same  as  the  expected  value  of  the  averaqe  of  the 
function,  in  the  present  nearly-linear  situation,  it  gives  a  good 
approximation.  Thus,  using  the  ”jn”  value  derived  above  as  an 
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approximation,  we  obtain 

jn  +  (3N/2  -  1)  *  (N  -  j)  <■  mN  (17) 

whose  high-order  terms  give 

2jn  +  3N*N  -  3jN  <-  2mN  (18) 

which  reduces  to  give  a  value  for  j,  which  we  denote  by  J(m,n), 
of 


3N  -  2m 

J  (m , n )  - *  N  (19) 

3N  -  2n 


This  means  that  the  average  number  of  nodes  searched  beneath  * 
nodes  that  are  not  better  than  all  preceding  *  nodes  is 


N— 1  n-1 

SUM  SUM  D(m|  *  J(-m,-n)  *  N 
n*l  m=0 

B (N)  »  -  f 20) 

NJ 


By  supplying  the  expressions  for  the  D  an  J  functions, 
simplifying,  and  adding  A(N) ,  we  find  the  total  number  of  leaves 
searched  to  be 


C(N)  »  N  *  N  *  H(N)  + 

N-1  n-1  3N-2m  N-m-2  (s+1) I  *  (N-s-2) 1  N-m-2 

N  *  SUM  SUM  -  *  SUM  -  *  PROD  t  (21) 

n*l  m*0  3N-2n  s*0  N-1  t=N-m-l-s 

PROD  x 
X*N-S-1 


Table  V  gives  leaf  exploration  figures  for  soecific  values  of  N, 
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and  compares  the  experimentally  derived  results  against  them. 
Since  simplifications  were  made  in  deriving  B(W),  asymptotic 
agreement  is  not  guaranteed.  However,  it  can  be  seen  that  the 
agreement  is  reasonably  good  (less  than  2  percent  deviation)  for 
larger  values  of  N. 


N 

4 

6 

8 

10 

20 

30 

40 

Algebraic 

57 

168 

438 

846 

6557 

21805 

51174 

Empirical 

54 

178 

418 

810 

6389 

21382 

50425 

Percent  Difference 

5.6 

5.3 

4.6 

4.3 

2.6 

1.9 

1.5 

Table  V  -  Algebraic  Average-Case  Analysis  of  Starl 
Compared  with  Empirical  Findings 

7.  Remaining  Considerations 

We  complete  our  presentation  and  analysis  of  the  *-minimax 
search  problem,  and  our  solution  to  it,  by  devoting  brief 
attention  to  some  remaining  topics. 

7.1  The  Efficacy  of  Search 

In  developing  and  examining  algorithms  for  *-minimax  trees, 
we  have  assumed  that  search  proceeds  until  either  a  leaf  is 
encountered  or  an  allowable  form  of  cutoff  occurs.  In  this  case, 
the  move  selected  is  guaranteed  to  be  optimal,  i.e.  to  have  an 
expected  value  at  least  as  good  as  the  alternatives.  As  we  shall 
observe  in  Section  7.5,  however,  it  is  seldom  possible  to  carry 
out  a  complete  search,  and  so  in  practice  the  values  -  of  many 
non-terminal  nodes  must  be  determined  by  static  evaluation. 

When  complete  search  is  not  possible,  both  intuition  and 
empirical  observation  suggest  that  deeper  search  will  result  in 
better  play.  However,  recent  results  by  Nau  [61  show  that  for 
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ordinary  minimax  trees,  deeoer  search  can  in  some  situations 
result  in  makinq  worse  moves.  Despite  the  existence  of  such 
"pathological"  trees  (the  terminology  of  [611,  it  is  probably 
advantageous  for  most  games  of  interest,  including  the  "-minimax 
games  we  have  been  considering,  to  search  as  deeDly  as  possible 
(at  least  as  deeplv  as  current  technology  permits) . 

Although  play  is  likely  to  be  better  with  deeoer  search,  no 
general  statement  can  be  made  as  to  how  much  better  it  can  be 
expected  to  be.  For  many  games  of  chance,  strategy  appears  to  be 
much  more  important  than  search,  and  in  fact  the  currently  too- 
ranked  backgammon  proqr&m  (BKG)  does  not  carry  out  a  tree  search. 
Although  its  author  indicates  that  he  selected  backgammon  for 
study  because  he  "wanted  was  a  domain  where  it  is  possible 
to  ...  make  a  judgment  ...  without  having  to  worry 
about  ...  exhaustive  analysis"  (Berliner  (71),  he  states 
elsewhere  that  "the  deeoer  one  could  look,  the  better  (al  proqram 
would  play"  (8]. 

Perhaps  one  reason  search  has  received  so  little  attention 
for  games  such  as  backgammon  is  that  the  0(N  **  (2D-1)) 
complexity  of  the  "obvious"  search  strategv  appears  infeasible. 
For  example,  Berliner  [91  observes  that 

the  throw  of.  a  pair  of  dice  can  produce  21  different 
results,  and  each  such  throw  can  be  played  about  20 
different  wavs  in  the  average  position.  Thus  a  look¬ 
ahead  would  have  to  acquiesce  to  a  branching  factor  of 
about  400  for  each  Dly  of  look-ahead;  an  exoonential 
growth  rate  than  could  not  be  tolerated  for  verv  lonq. 

In  this  paper,  however,  we  have  presented  an  algorithm  which 
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reduces  this  branching  factor  of  400  to  677/20,  or  about  34. 
Since  we  have  assumed  equally  likely  chance  outcomes  (however, 
see  Section  7.3)  and  made  other  pedagogical  assumptions,  this 
should  be  treated  as  onlv  an  approximate  figure  for  a  particular 
game  such  as  backgammon. 

7.2  *n  Pnusual  Form  of  Cutoff 

Knuth  and  Moore  (21  have  shown  that  whereas  the  alpha-beta 
algorithm  for  minimax  trees  is  more  powerful  than  the  more 
obvious  branch-and-bound  strategy,  there  is  no  uniformly  stronger 
method.  This  assumes,  however,  that  we  must  determine  the 
precise  value  of  the  root,  not  just  the  best  move.  Thus,  if  the 
root  node  of  a  minimax  tree  has  H  successors,  and  the  first  N-l 
of  them  have  been  searched  and  all  found  to  have  the  lowest 
possible  game  value  (e.g.  "forced  loss”),  alpha-beta  will  still 
search  the  remaining  node,  even  though  this  node  is  known  to  be 
at  least  as  good  as  any  of  of  the  alternativesl  In  the 
degenerate  case,  this  would  mean  searching  a  tree  with  only  one 
branch  (denoting  a  "forced”  move)  from  the  root.  Heedless  to 
say,  existing  game-playing  proqrams  typically  respond  without 
search  in  these  situations. 

When  searching  the  last  successor  of  the  root  of  a  *-minimax 
tree,  a  stronger  form  of  cutoff  can  be  made.  In  particular,  we 
can  discontinue  search,  knowing  that  the  rightmost  node  is 
strictly  better  than  the  alternatives,  even  though  we  may  not 
know  its  exact  value.  This  is  because  the  value  of  a  *  node  is 
partially  determined  by  the  value  of  each  of  its  successors, 
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while  a  -  node  is  fully  determined  by  one  of  its  successors.  We 
implement  this  form  of  cutoff  by  discontinuing  search  beneath  the 
rightmost  *  node  when  its  lower  ‘-value  exceeds  the  aloha  value 
(rather  than  beta)  passed  into  it.  Being  able  to  cut  off  below 
the  node  corresponding  to  the  best  move,  without  knowing  its 
exact  value,  can  be  important  in  reducing  search  time  for  *- 
minimax  trees,  especially  with  a  narrow  branching  factor  below  + 
and  -  nodes  (e.g.  in  casino  blackjack,  with  a  branching  factor  of 
2  beneath  +  nodes) . 

7.3  Differing  Probabilities  Below  a  *  Node 

Neither  algorithm  presented  above  considers  the  situation 
where  not  all  outcomes  of  the  chance  event  are  equally  likelv. 
If  Pi  denotes  the  probability  with  which  the  i-th  successor  of  a 
node  occurs,  then  the  left  side  of  (la)  is  replaced  bv 

(P1V1  +  ...  +  Pi-lVi-1)  +  PiVi  +  UM1  -  PI  -  ...  -  Pi)  (22) 
and  (2a),  (3a),  and  (4a)  are  modified  accordingly. 

In  searching  ordinary  minimax  trees,  the  static  evaluation 
function,  or  a  similar  "plausible  move  generator",  is  often  used 
to  determine  the  order  in  which  to  consider  successors  of  a  node. 
When  the  probabilities  of  outcomes  differ  in  ‘-minimax  trees,  a 
potentially  useful  strategy  is  to  examine  more  likelv  successors 
first,  since  their  values  will  more  strongly  influence  the  *  node 
^value.  However,  one  must  weigh  against  this  the  likelihood  of  a 
useful  (i.e.  extreme)  value,  and  also  the  probable  number  of 
nodes  to  be  pruned  (i.e.  below  sister  nodes)  if  a  cutoff  does 
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occur.  In  *-minimax  trees,  where  cutoffs  are  harder  to  come  by, 
the  typical  tradeoff  between  the  likelihood  and  benefit  of  a 
cutoff  is  compounded  in  the  case  of  differing  probabilities. 
Decisions  as  to  which  combination  of  strategies  to  adopt  are  best 
made  by  considering  the  idiosyncrasies  of  the  particular  game 
under  consideration. 

7.4  Additional  Modifications 

An  interesting  modification  to  the  Starl  procedure,  which  we 
have  not  pursued  in  this  paper,  can  be  derived  by  relaxing  the 
left-to-right  restriction  on  *  nodes,  rather  than  simplv  on  the  - 
nodes  beneath  them  as  we  did  to  develop  Star 2.  In  particular, 
one  or  several  -  nodes  beneath  each  *  node  would  be  examined  in 
an  attempt  to  decide  which  *  nodes  appear  best.  Since  cutoffs 
below  *  nodes  occur  when  the  *  node  is  found  to  be  worse  than 
some  preceding  *  node,  it  is  useful  to  find  the  best  *  node,  or  a 
good  one,  before  carrying  out  exhaustive  or  nearly  exhaustive 
search  beneath  inferior  *  nodes. 

Preliminary  experimental  results  indicate  that  for  leaf 
dependencies  of  the  sort  defined  in  Section  6,  and  for  a 
branching  factor  of  40,  average-case  search  complexity  is  reduced 
from  35  percent  to  30  percent  when  the  best  *  node  is  considered 
first,  and  from  35  percent  to  31.4  percent  when  the  third-best 
(of  the  40)  *  node  is  seen  first.  When  the  best  *  node  is 
considered  first  and  2-node  cycling  is  performed  beneath  *  nodes, 
average-case  search  complexity  is  reduced  to  26.3  percent. 


7.5  Incorporating  *-Minimax  Search  Into  a  Complete  Game  Program 
In  programming  actual  minimax  games,  adjustments  are  often 
made  to  a  oure  alpha-beta  search  because  of  the  overwhelming  size 
of  most  search  trees.  In  particular,  a  static  evaluation 

function  is  generally  used  to  rank  successor  nodes  in  what 

appears  (before  searching)  to  be  best-to-worst  order,  hoping  to 
assure  early  cutoffs;  a  depth  bound  is  often  maintained  in  some 
form  to  preclude  searching  prohibitively  deep  nodes;  forward 
pruning  is  performed,  meaning  that  some  nodes  which  look 

unpromising  are  not  searched  at  all;  a  transposition  table  is 
maintained  to  avoid  searching  the  same  position  more  than  once  if 

it  appears  in  several  places  ("transpositions")  in  the  search 

tree;  and  so  forth.  In  practice,  we  would  expect  such 


modifications  to  be  made  to  the  *-minimax  procedures  as  well, 
although  the  underlying  algorithms  need  not  be  changed. 
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